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1 .  Introduction 


Detection  of  stochastic  signals  in  non-white  Gaussian  noise  can  be 
reasonably  characterized  as  the  fundamental  problem  in  sonar  signal  detection. 
It  also  arises  in  many  other  applications;  for  example,  in  radar  problems 
involving  clutter,  a  Gaussian  model  is  frequently  used  to  model  the  clutter. 
The  typical  problem  arises  in  continuous  time  with  observation  over  some  time 
interval,  although  modern  signal-processing  capabilities  usually  result  in  a 
discrete-time  detection  algorithm.  The  desired  solution  is  an  algorithm  based 
on  the  likelihood  ratio,  which  is  of  course  the  optimum  operation  under 
various  criteria.  In  particular,  any  strictly  monotone  increasing  function  of 
the  likelihood  ratio  provides  a  statistical  test  satisfying  the  Neyman-Pearson 
criterion:  maximization  of  probability  of  detection  (Pp)  for  any  fixed  value 
of  the  probability  of  false  alarm  (Pp-^)-  Likelihood-ratio  solutions  have  been 
known  for  some  special  cases,  as  discussed  in  the  next  section  of  this  paper. 
However,  these  special  cases  do  not  anply  to  a  large  proportion  of  the 
problems  encountered  in  applications. 

For  discrete-time  finite-sample-size  data,  the  likelihood  ratio  is  easy 
to  construct;  it  is  simply  the  ratio  of  the  (multivariate)  S+N  probability 
density  function  to  the  multivariate  N  (noise)  probability  density  function. 

Of  course,  this  requires  not  only  that  both  density  functions  exist,  but  also 
(for  implementation)  that  they  be  known.  "Ah,  there’s  the  rub;"  these  density 
functions  are  typically  not  known  in  sonar. 

This  paper  presents  new  algorithms  for  this  problem.  They  are  based  on 
recent  and  very  general  mathematical  solutions  to  the  continuous- time  problem, 
and  are  obtained  in  discrete-time  form  as  approximations  to  the  exact 
continuous- time  likelihood  ratio.  They  are  in  functional  form.  Their 
optimality  has  the  unusual  property  of  not  requiring  any  prior  knowledge  of 
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the  signal-plus-noise  (S+N)  statistical  properties;  instead,  it  is  required 
that  one  know  the  drift  function  of  a  diffusion.  The  algorithms  can  be  used 
for  passive  or  active  detection. 

The  mathematical  solution  to  the  problem  of  discriminating  between  a 
Gaussian  process  (noise)  and  a  second  stochastic  process  of  a  general  type  is 
given  in  [1],  under  very  mild  assumptions  on  the  noise  process.  The  results 
Include  conditions  for  existence  of  the  likelihood  ratio,  and  expressions  for 
the  likelihood  ratio  when  it  exists.  The  analysis  in  [1]  is  carried  out  for 
the  continuous-time  problem;  the  results  for  the  likelihood  ratio  require 
knowledge  that  is  not  generally  available  in  applications.  Thus,  effective 
employment  of  these  results  in  operational  systems  requires  that  a  discrete¬ 
time  approximation  be  developed,  and  that  such  an  approximation  be  given  in 
terms  of  quantities  that  can  reasonably  be  expected  to  be  available  from 
either  prior  knowledge  or  estimation  from  observed  data. 

A  first  approximation  to  the  general  solution  is  given  in  [2].  However, 
it  has  the  disadvantage  that  it  requires  determination  of  the  eigenvectors  and 
associated  eigenvalues  of  the  noise  matrix,  a  problem  that  can  be  extremely 
difficult  when  using  data  containing  many  sample  points,  and  which  does  not 
lend  itself  to  a  recursive  formulation.  An  implementation  was  given  in  [3] 
which  does  not  suffer  these  disadvantages.  However,  that  paper  did  not  give 
an  adaptive  formulation,  nor  a  formulation  based  on  time-varying  properties  of 
the  S+N  process.  Neither  [2]  nor  [3]  contains  a  rigorous  justification  of  the 
approximations  given:  in  particular,  both  omit  the  description  of  key  points 
that  are  crucial  for  the  justification  of  the  algorithms.  Conversely,  the 
results  for  the  continuous-time  problem  are  stated  in  mathematical  language 
which  is  likely  to  be  unfamiliar  to  many  readers  interested  primarily  in 
applications. 
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All  of  these  omissions  are  remedied  in  the  present  paper.  Two  algorithms 
are  presented;  one  is  fully  adaptive  to  the  S+N  process;  the  other  requires 
prior  knowledge  of  a  diffusion  drift  function  which  (after  filtering) 
represents  the  signal  process.  Both  algorithms  require  knowledge  of  the  noise 
covariance  matrix,  but  not  of  its  eigenvalues  or  eigenvectors.  It  will  be 
assumed  throughout  that  the  noise  covariance  matrix  is  known  and  that  the 
noise  has  zero  mean. 

The  development  here  is  aimed  toward  readers  who  are  interested  in  the 
actual  development  of  detection  algorithms.  Given  the  theoretical  basis  of 
the  algorithms,  the  adaptive  feature  of  one.  and  the  many  alternatives 
available  for  their  implementation,  they  appear  to  provide  a  practical 
solution  to  this  important  class  of  detection  problems,  with  potential 
performance  definitely  superior  to  that  of  detection  methods  currently  in  use 
when  the  signal -plus-noise  process  is  braodband.  In  order  that  their 
potential  be  fully  developed,  it  is  important  that  it  be  appreciated  by  those 
who  actually  develop  surveillance  systems.  Thus,  we  have  minimized  the 
mathematical  jargon  so  far  as  possible,  have  replaced  theorems  by  descriptions 
and  interpretations  of  results,  and  have  emphasized  the  applications  aspects 
of  the  results  instead  of  the  mathematical  side.  However,  only  so  much  can  be 
done  in  this  direction;  we  hope  that  those  strongly  interested  in  the  problem 
will  not  be  deterred  by  some  of  the  unavoidable  complexities.  A  more 
mathematical  development  of  the  results  obtained  here,  along  with  additional 
results,  is  contained  in  [22]. 

In  addition  to  the  results  described  here,  we  have  obtained  similar 
results  for  detection  when  the  noise  is  a  Gaussian  mixture  (spherically 
invariant  process);  such  processes  represent,  for  example.  Gaussian  processes 
having  covariance  known  except  for  a  scale  factor.  Those  results  will  be 
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published  elsewhere;  partial  results  are  contained  in  [3]  and  [22]. 

The  results  given  here  have  already  been  obtained  from  us  on  a  privileged 
basis  by  some  workers  in  signal  detection,  and  have  subsequently  been  included 
by  at  least  one  for-profit  company  in  successful  proposals  for  Navy  funding  on 
development  of  detection  algorithms  for  this  problem.  One  of  the  objectives 
of  preparing  this  paper  is  to  make  the  algorithms  generally  available  to 
everyone  involved  in  detection  of  nonGaussian  (or  Gaussian)  signals  in 
Gaussian  noise.  We  now  have  substantial  extensions  of  these  algorithms,  to  a 
wide  class  of  nonGaussian  noise  processes;  the  algorithms  given  here  are  the 
basic  building  blocks  for  those  algorithms. 

2.  Previous  Results  on  Detection  of  Stochastic  Signals 

The  problem  of  detecting  a  general  signal  in  Gaussian  noise  has  been  of 
acknowledged  importance  for  many  years.  The  major  special  cases  fo.  which  a 
solution  has  been  obtained  are  listed  below.  Here,  and  throughout  the  paper, 
by  a  likelihood  ratio  we  mean  a  test  statistic  that  is  a  monotone  function  of 
the  likelihood  ratio.  The  following  cases  represent  those  for  which  a 
likelihood  ratio  or  generalized  likelihood  ratio  [4],  [5]  are  known. 

(1)  The  signal  is  a  known  waveform  (for  which  the  classical  matched  filter  is 
a  likelihood  ratio),  or  a  waveform  known  except  for  a  fixed  set  of 
parameters  (e.g. ,  amplitude,  or  phase,  or  time  of  arrival)  whose 
statistical  distribution  is  known  or  can  be  assumed. 

(2)  A  Gaussian  signal -plus-noise  process. 

(3)  A  Wiener  process  as  the  noise. 

(4)  Independent  signal  and  noise,  with  the  signal  paths  being  comparatively 
smooth  compared  to  those  of  the  noise. 
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Each  of  these  problem  classes  has  its  drawbacks  as  a  model  for 
applications  in  sonar.  The  first  category  of  problems  is  not  sufficiently 
rich.  The  second  [6],  [7]  is  a  class  that  is  often  valid,  and  whose 
Implementation  is  fairly  simple,  requiring  only  quadratic-plus- linear 
operations  on  the  data.  It  requires  knowledge  of  the  mean  vectors  and 
covariance  matrix  of  both  the  noise  (N)  and  the  signal-plus-noise  (S+N) 
processes.  These  parameters  are  generally  difficult  to  obtain  in  advance  for 
the  S+N;  another  important  drawback  is  that  the  model  itself  is  not  valid  in 
many  situations;  even  in  cases  where  it  may  be  valid,  the  S+N  process  can  vary 
over  time  to  having  strongly  nonGaussian  properties,  and  an  algorithm  limited 
to  the  assumption  of  Gaussian  S+N  then  does  not  have  the  flexibility  of 
optimal  operation. 

Category  (3)  [8],  [9].  [10]  has  a  very  attractive  formulation  for  the 
likelihood  ratio,  which  has  led  to  some  questionable  utilization.  In  fact, 
however,  it  is  not  suitable  as  model  for  most  applications  in  sonar.  First, 
the  sample  paths  of  the  Wiener  process  are  far  too  irregular  to  give  a 
reasonable  model  of  ocean  noise,  being  non-dif ferentiable  at  all  sample 
points.  Second,  the  probabilistic  properties  of  the  Wiener  process  are  not 
found  in  most  applications,  as  they  include  those  of  being  Markovian  and  a 
martingale. 

Conditions  for  existence  of  the  likelihood  ratio  for  problems  fitting  the 
model  of  (4)  have  been  given  in  [9]  and  [11].  An  expression  for  the 
likelihood  ratio  is  included  in  [11].  However,  this  model  is  also  not 
generally  practical;  first,  many  important  applications  do  not  have 
independent  signal  and  noise;  second,  the  expression  for  the  likelihood  ratio 
as  given  in  [11]  requires  that  one  know  the  statistical  distributions  of  the 
signal,  which  are  not  likely  to  be  available. 
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The  difficulty  of  the  general  problem  has  led  to  the  use  of  suboptimum 
methods;  that  is,  methods  based  on  criteria  other  than  Neyman- Pear son  or 
minimum  Bayes  risk.  Probably  the  most  widely-used  such  method  assumes 
quadratic-plus-linear  operations,  with  the  choice  of  the  operations  based  on 
the  deflection  criterion.  This  dates  back  to  at  least  the  early  1940’s  [12], 
although  a  general  solution  to  the  problem  was  obtained  only  in  1969  [13]  (see 
also  [14]).  Unfortunately,  as  shown  in  [13],  the  optimum  operation  from  the 
deflection  criterion  is  not  a  likelihood  ratio  test  statistic  even  when  the 
S+N  process  is  Gaussian.  Nevertheless,  the  simplicity  of  the  mathematical 
model  and  the  generality  of  the  solution  has  led  to  many  papers  being  written 
on  various  modifications  of  the  deflection. 

3.  Objectives  and  Summary  of  Approach 

Before  moving  to  the  technical  content  of  the  paper,  we  summarize  our 
objectives  in  this  work.  We  have  sought  to  obtain  a  detection  algorithm,  or 
algorithms,  meeting  the  following  criteria. 

(1)  Likelihood-ratio  based. 

(2)  Information-preserving.  The  solution  should  begin  with  analysis  of  the 
original  continuous-time  problem.  The  likelihood-ratio  for  the 
continuous-time  problem  should  then  be  approximated  as  well  as  possible 
in  forming  a  discrete-time  algorithm.  Independent  sampling,  which 
typically  destroys  information,  should  not  be  used  unless  absolutely 
necessary. 

(3)  Implemen table.  An  arcane  theorem  on  the  form  of  the  likelihood  ratio  is 
not  our  goal;  although  such  theorems  can  be  expected  to  form  part  of  the 
process  of  obtaining  a  practical  algorithm,  they  must  be  developed  into 
expressions  that  can  reasonably  be  expected  to  be  implemented.  Thus, 
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they  must  not  require  information  that  is  not  reasonably  accessible  in 
applications. 

(4)  Adaptive.  A  version  of  the  algorithm  should  be  adaptive,  particularly  to 
the  S+N  process. 

As  will  be  seen,  all  four  of  these  goals  have  been  met.  Two  algorithms 
will  be  given;  one  is  fully  adaptive  to  the  S+N  process;  the  second  requires 
prior  knowledge  of  the  S+N  process,  but  has  more  flexibility  in  its 
implementation  when  that  information  is  available.  Neither  algorithm  requires 
the  S+N  process  to  have  any  particular  statistical  properties.  The  S+N 
process  cam  have  any  statistical  properties,  with  Gaussian  or  nonGaussian 
distributions. 

In  keeping  with  the  above  objectives,  a  general  solution  to  the 
continuous-time  problem  was  first  obtained.  This  is  described  in  the 
following  section.  The  likelihood  ratio  obtained  in  that  work  was  then 
approximated  to  obtain  detection  algorithms;  this  is  described  in  the  second 
section  below. 

The  approach  taken  can  be  summarized  as  follows.  First,  we  obtain  a  set 
of  necessary  conditions  and  a  set  of  sufficient  conditions  in  order  that  the 
likelihood  ratio  exist  between  some  process  Y  and  our  noise  process  N.  for  the 
original  continuous-time  problem.  This  is  a  likelihood  ratio  for 
probabilities  on  the  space  of  observations,  which  we  take  to  be  LgCO.l].  The 
sufficient  conditions  are  satisfied  in  typical  sonar/radar  problems,  and. 
assuming  these  conditions,  we  obtain  an  expression  for  the  likelihood  ratio. 
This  mathematical  expression  for  the  continuous-time  problem  will  not,  in 
general,  be  lmplemen table;  there  will  not  be  sufficient  information  available 
in  applications.  A  basic  problem  is  that  the  original  representation  (see 
equation  (1)  below)  of  the  noise  process  will  typically  not  be  known  and 
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cannot  be  determined  in  its  original  generality.  Of  course,  if  one  has  a 
mathematical  model  in  which  the  representation  is  known,  then  this  is  not  the 
case.  However,  here  we  are  assuming  the  usual  situation  in  practice:  no  valid 
mathematical  model  giving  the  representation  (1)  will  be  availble.  Thus,  we 
make  an  approximation  to  this  representation  (justifying  the  approximation) 
and  then  use  the  necessary  conditions  for  existence  of  the  likelihood  ratio  to 
obtain  an  expression  for  the  likelihood  ratio  which  is  based  on  the  S+N 
process  having  a  representation  as  a  filtered  diffusion.  This  representation 
is  justified.  Finally,  using  this  representation,  we  approximate  the 
continuous-time  likelihood  ratio  to  obtain  a  discrete-time  detection 
algorithm.  The  estimation  of  the  drift  function  of  the  underlying  diffusion 
then  becomes  the  basic  problem  in  implementing  the  detection  algorithm;  there 
are  various  ways  one  may  approach  this  problem. 

4.  Solution  of  the  Continuous-Time  Problem 

Summary.  We  assume  a  mean-square  continuous  Gaussian  process  N  =  (N(t), 
t  in  [0,1]}  such  that  N(0)  =  0  with  probability  one.  The  only  other 
assumption  is  that  N  have  finite  "spectral  multiplicity"  (also  called 
Cram6r-Hida  multiplicity;  see  [15,  16]).  Many  processes  have  finite 
multiplicity,  and  any  m.s.  continuous  Gaussian  process  can  be 
approximated  by  a  finite-multiplicity  (in  fact,  multiplicity-one) 
process.  We  obtain  conditions  for  existence  of  the  likelihood  ratio 
between  N  and  a  second  process  Y.  Y  can  be  thought  of  as  signal-plus- 
noise.  Then,  under  the  assumption  that  the  likelihood  ratio  exists,  we 
obtain  an  expression  for  it.  In  the  process,  we  also  show  that  Y  must 

M  K  M 

have  a  representation  as  Y  =  S  +N  ,  where  N  is  a  noise  with  the  same 
family  of  finite-dimensional  distributions  as  N.  but  with  the  important 
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If 

difference  that  N  is  a  function  of  Y.  This  is  crucial  in  justifying  the 
final  result  for  the  likelihood  ratio.  Next  we  show  that  Y  =  (Y(t), 
t  €  [0.1]}  must  (if  the  likelihood  ratio  exists)  have  an  approximation  as 
a  filtered  diffusion.  The  filter  and  the  drift  function  of  the  diffusion 
then  characterize  the  Y  (S+N)  process. 


This  section  contains  a  summary  of  the  solution  to  the  continuous-time 
problem  for  deciding  whether  an  observed  sample  function  is  due  only  to  noise 
(a  Gaussian  process  of  known  covariance)  or  to  another  process.  This  second 
process  may  be  Gaussian  or  nonGaussian;  no  assumptions  are  made  on  its 
statistical  properties.  The  mathematical  model  will  first  be  described;  next, 
we  give  the  solution  to  the  problem  of  obtaining  conditions  for  existence  of 
the  likelihood  ratio;  finally,  expressions  for  the  likelihood  ratio  are  given. 
We  do  not  aim  for  precision  and  generality  in  this  discussion.  Instead,  we 
aim  to  state  the  results  in  such  a  way  that  they  will  be  accessible  for  a 
reader  not  having  background  in  the  esoteric  mathematics  of  stochastic 
calculus  and  measures  on  linear  spaces.  A  precise  and  complete  description  of 
the  mathematical  problem,  a  precise  theorem- format  statement  of  the  results 
described  here,  and  complete  proofs,  are  contained  in  [1]. 

The  noise  is  assumed  to  be  a  mean-square  continuous  Gaussian  process. 

The  observation  interval  is  [0,1],  although  any  finite  interval  can  be  used. 

We  assume  that  the  noise  is  "purely  nondeterministic"  [15],  [16];  in  essence, 
that  the  noise  at  time  zero  has  the  value  zero  with  probability  one.  Under 
these  assumptions,  it  is  known  [15],  [16]  that  the  noise  has  a  representation 
of  the  following  form: 


N 


t 


M 

2 

i=l 


t.sJdB^s). 


(1) 
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For  each  of  the  terms  appearing  in  the  above  sum,  the  function  is  a 
deterministic  function.  The  processes  are  Gaussian  processes  that  are 
mutually  independent  and  have  independent  increments;  they  can  also  be  taken 
to  be  mean-square  continuous.  The  variance  of  the  process,  which  we  denote 
by  /Jj ,  is  non-decreasing  on  [0,1]  and  so  defines  an  ordinary  measure  on  [0.1] 
in  the  usual  way:  the  measure  of  an  interval  (a.b)  is  equal  to  0j(b)  -  0j(a). 
Let  l^^i^  denote  the  space  of  all  functions  on  [0.1]  that  are  square- 
integrable  with  respect  to  the  measure  defined  by  f3 .  Then,  for  each  fixed  i 
and  t.  Fj(t,#)  belongs  to  ^[Pj]-  The  integrals  in  (1).  expressed  in  terms  of 
the  stochastic  processes  B^.  are  described  in  many  texts:  see,  for  example. 
[17]  or  [18].  Essentially,  they  can  be  regarded  as  Stielt jes-type  integrals. 

The  above  representation  is  due  to  Hida  [15]  and  Cramer  [16].  The  number 
M  can  be  infinite;  it  is  the  multiplicity  of  the  process.  One  additional 
assumption  to  be  made  here  is  that  this  multiplicity  is  finite.  This  does  not 
seem  to  be  a  serious  restriction;  it  is  known  that  processes  of  multiplicity 
one  are  dense,  in  a  mean-square  sense,  among  all  processes  having  a 
representation  of  the  above  form  [19]. 

The  first  question  that  must  be  considered  is  that  of  obtaining 

conditions  for  existence  of  the  likelihood  ratio.  In  the  case  where  the 

» 

signal-plus-noise  is  Gaussian,  it  is  known  [6],  [7]  that  conditions  can  be 
given  that  are  necessary  and  sufficient.  However,  if  a  general  nonGaussian 
S+N  process  is  to  be  admitted,  one  obtains  a  set  of  necessary  conditions  and  a 
separate  set  of  sufficient  conditions;  at  first  glance,  similar;  on  close 
examination,  quite  different. 

We  now  give  a  descriptive  summary  of  these  conditions  for  existence  of 
the  likelihood  ratio.  They  will  be  stated  in  a  rather  imprecise  mathematical 
form  and  all  the  aspects  will  not  be  given;  our  objective  here  is  to  bring  out 
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the  main  points  and  to  illustrate  the  importance  of  some  aspects  of  these 
conditions.  Readers  whose  primary  interest  is  in  development  of  the 
algorithms  nay  nevertheless  find  a  careful  reading  and  mulling-over  of  these 
conditions  to  be  valuable  in  unuerstanding  how  the  subsequent  detection 
algorithms  follow  from  the  complicated  mathematical  results  of  [1].  Many  ad 
hoc  detection  algorithms  have  been  proposed  for  problems  such  as  the  one 
described  here,  and  such  algorithms  are  sometimes  useful  for  purposes  other 
than  production  of  papers.  However,  the  algorithms  to  be  given  in  the  next 
section  are  not  of  this  genre;  they  have  their  genesis  in  the  conditions  for 
existence  of  the  continuous- time  likelihood  ratio. 

The  necessary  conditions  are  given  in  Theorem  2  of  [1].  Here,  we  shall 
paraphrase  that  result  in  descriptive  terms.  First,  let  V  be  any  stochastic 
process  independent  of  N.  Let  Y  be  any  process  defined  on  [0,1].  We  want 
conditions  that  are  necessary  for  the  likelihood  ratio  to  exist  for 
discrimination  between  Y  and  N.  Suppose  that  Y  is  adapted,  to  the  combined 
processes  V  and  N;  in  essence,  this  means  that  the  Y  process  at  time  t  can  be 
obtained  as  a  function  of  the  V  and  N  processes,  up  to  time  t.  Then,  if  the 
likelihood  ratio  exists,  Y  must  have  a  representation  of  the  form 


Y  =  S*  +  N*  (with  probability  one) 

for  all  fixed  t  in  [0,1].  The  process  N  has  the  same  family  of  finite¬ 
dimensional  distributions  as  the  original  noise  process,  and  has  a  Cram^r-Hida 
representation  of  the  same  form: 


*  H  ft  * 

N  =  2  F  (t.s)dB  (s) 
1  i=l  J0  1 


(2) 


where  the  processes  each  have  the  same  law  as  the  original  process,  and 
the  Fj  functions  are  those  that  appear  in  the  representation  (1)  for  the  N 
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process.  However,  the  N  process  has  the  property  (not  generally  shared  by 
the  N  process)  that  It  can  be  written  as  a  functional  of  the  Y  process. 
Moreover,  the  signal  process  (S)  must  have  its  sample  paths  (with  probability 
one)  belonging  to  the  reproducing  kernel  Hilbert  space  (RKHS;  see.  for 
example.  [17]  or  [20])  of  the  noise  covariance  function,  and  must  also  be 
adapted  to  the  Y  process.  The  RKHS  condition  can  be  Interpreted  as  follows:  A 
function  f  is  in  the  RKHS  of  N  if  and  only  if  the  likelihood  ratio  exists 
between  N  and  N+f.  In  essence,  the  RKHS  condition  means  that  f  is  sufficiently 
smooth  (smoother)  as  compared  to  the  sample  functions  of  the  noise. 

A  very  Important  part  of  the  above  statement  is  that  existence  of  the 
likelihood  ratio  implies  that  the  representation  of  (Yt)  as  given  above  is 
with  respect  to  a  process  (Nt)  which  has  the  same  distributions  as  (Nt)  but 
has  the  property  that  it  can  be  represented  as  a  functional  of  (Y^),  which  is 
not  the  case  for  the  original  (Nt).  In  the  next  section,  the  crucial 
importance  of  this  fact  will  be  more  apparent;  we  defer  a  discussion  until 
that  point. 

In  the  above  necessary  condition  for  existence  of  the  likelihood  ratio, 
we  have  introduced  an  independent  process  V.  In  sonar  applications,  this  is 

immediately  interpreted.  For  active  sonar,  reflection  from  a  target  with 

1 

random  reflecting  surfaces  would  be  a  function  of  a  process  (the  random 
reflecting  surfaces)  that  is  essentially  independent  of  the  scatterers  that 
produce  reverberation.  Thus,  the  received  signal  Y  would  depend  on  the  noise 
process  (reverberation  plus  receiver  noise  )N  and  on  the  target  reflection 
process  V.  In  the  case  of  passive  detection,  the  same  rationale  holds;  this 
is  particularly  apparent  for  broadband  signal  sources.  Thus,  the  initial 
assumption  that  (Y()  be  a  functional  of  V  and  N  has  clear  physical  meaning. 

We  now  turn  to  a  summary  of  the  sufficient  conditions  for  existence  of 
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the  likelihood  ratio  as  given  in  Theorem  3  of  [1].  As  the  above  discussion 
shows,  we  can  begin  by  assuming  that  (Y^)  is  a  functional  of  (N t )  and  a 
process  (Vt)  that  is  independent  of  (Nt).  Under  this  assumption,  the 
likelihood  ratio  dPy/dP^  will  exist  if  Y  has  a  representation  as 

Y  =  St  +  Nt  (with  probability  one) 

for  each  fixed  t  in  [0,1],  where  S  is  a  process  having  all  paths  (with 
probability  one)  in  the  RKHS  of  the  noise  covariance  function,  and  such  that  S 
is  adapted  to  the  combined  V  and  N  processes. 

One  may  note  the  difference  between  the  necessary  condition  and  the- 
sufficient  conditions.  In  both  cases,  Y  has  a  representation  as  the  sum  of  a 
signal  process  and  a  noise  process.  In  the  representation  for  Y  under  the 
sufficient  conditions,  the  noise  process  is  the  original  noise.  In  the 
necessary  conditions,  the  noise  process  (N  )  has  the  same  f ini te-dimensi Dnal 
distributions  as  the  original  noise  process  (N),  but  is  a  functional  of  the  Y 
process.  The  S  process  has  similar  representations  in  both  sets  of 
conditions;  in  particular,  in  both  cases  it  has  sample  paths  belonging  (with 
probability  one)  to  the  RKHS  of  the  noise  covariance  function.  However,  in 
the  necessary  conditions,  it  is  seen  that  the  process  (St)  must  be  adapted  to 
the  Y  process  (which  in  turn  is  adapted  to  the  combined  V  and  N  processes), 
while  in  the  sufficient  conditions  (St)  must  only  be  adapted  to  the  combined  V 
and  N  processes.  Thus,  the  properties  of  the  process  S  appearing  in  the 

M  M 

necessary  conditions  representation  (Y  =  S  +  N  )  are  actually  stronger  than 
the  properties  of  the  process  S  appearing  in  the  sufficient  conditions 
representation  (Y  =  S  +  N). 

The  sufficient  conditions  for  existence  of  the  likelihood  ratio  are  those 
which  one  might  expect  to  be  satisfied  in  the  usual  sonar  or  radar  problem. 
That  is,  the  problem  is  usually  one  containing  signal  and  additive  noise;  the 
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signal  process  will  typically  be  a  functional  of  the  noise  process  and  some 
independent  message  process,  as  previously  described.  Thus,  the  assumption 
that  the  sufficient  conditions  are  satisfied  is  quite  tenable. 

The  necessary  conditions  and  the  sufficient  conditions  are  quite 
Important.  The  combination  of  the  sufficient  conditions  and  the  known 
physical  properties  of  typical  sonar  detection  problems  permit  one  to 
determine  a  likelihood  ratio.  Given  the  form  of  that  likelihood  ratio,  the 
necessary  conditions  are  then  used  to  Justify  a  functional  representation  of 
the  S+N  process  (Y  in  the  alove)  which  leads  to  a  representation  of  the  S+N 
process  as  a  filtered  diffusion.  This  will  be  carried  out  in  the  following 
section  However,  we  first  give  an  expression  for  the  continuous-time 
likelihood  ratio. 

Referring  to  the  original  representation  (1)  of  the  noise  process,  one 
shows  the  existence  of  a  function  m  defined  as  follows.  The  range  of  m  is 
contained  in  the  space  of  vector  functions  of  M  components,  where  each 
component  is  a  function  on  [0.1]  which  is  continuous  and  vanishes  at  zero. 

The  domain  of  m  is  in  L2[0.1].  m  is  not  defined  everywhere  in  LgfO.l],  but  on 
a  dense  subset  with  probability  one.  Essentially,  m  can  be  Interpreted  as 
follows.  Let  $  be  the  map  which  takes  the  vector  process  (Bj,...  •V  into  the 
noise  process  N  (see  equation  (1)).  This  map,  which  has  its  domain  in  the 
space  of  vector-valued  M-component  continuous  functions  vanishing  at  zero  and 
its  range  in  L^CO.l],  is  also  defined  in  a  probability  one  sense.  Then 
m[#](x)  =  x  with  probability  one  (P^)  for  x  an  M-vector  of  continuous  func¬ 
tions  vanishing  at  zero.  Thus,  roughly,  one  can  view  m  as  being  the  inverse  of 
the  "function”  which  maps  the  vector  process  (Bj . B^)  into  the  process  N. 

The  sufficient  conditions  described  above  are  reasonable,  given  the 
physical  nature  of  the  detection  problem.  We  assume  they  are  satisfied.  They 
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show  that  we  can  express  S+N  (or  Y,  in  the  above)  in  the  form 


Yt  =  [FdZ]t 


N 

2 

i=l 


t, s)dZj(s) 


where  Z  is  a  vector  process  with  M  components,  each  having  continuous  sample 
paths,  and  with  i**1  component  having  the  form 


Z^t) 


VjfsJdp^s) 


+  B1(t) 


where  (v^).  i  £  M,  are  stochastic  processes,  with  v^  havings  its  sample  paths 
(w.p.  1)  in  LgCPj].  Now.  it  can  be  shown  that  the  likelihood  ratio  must  exist 
between  the  two  vector  processes  Z  and  B;  write  this  likelihood  ratio  as 
dP„/dPn .  This  is  a  likelihood  ratio  on  the  space  of  M-component  vector 
functions  where  each  component  is  a  continuous  function  on  [0,1]  that  vanishes 
at  zero.  Then,  the  likelihood  ratio  dP^g+Nj/dPN  has  the  form  (when  a  sample 
function  is  observed) 


dP, 


S+N 


dP„ 


dP, 


(x)  = 


N 


dP, 


B 


(m[x]) 


(with  probability  one  dPg). 


(3) 


We  now  come  to  a  very  important  point.  The  likelihood  ratio  given  above 
is  based  on  the  assumption  that  the  sufficient  conditions  are  satisfied. 
However,  these  conditions  assume  a  representation  of  Y  =  S  +  N.  where  N  is  the 
actual  noise  process.  Our  eventual  algorithm  will  be  derived  by  first 
assuming  that  the  noise  process  N  has  multiplicity  M  =  1  in  equation  (1),  and 
then  representing  the  process  Z  above  (in  multiplicity  one)  as  a  process  of 
"diffusion  type,"  having  the  form 


and 


u  £  s]ds  +  Wt 


(4) 
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where  a  Is  a  deterministic  function  and  V  is  a  standard  Wiener  process.  Such 
a  representation  for  Y  always  exists  [17]  if  the  likelihood  ratio  dPy/dP^ 
exists.  However,  (Z^ )  depends  (in  its  statistical  properties)  only  on  W.  In 
our  following  analysis,  the  noise  will  be  obtained  from  the  Wiener  process; 
that  is. 


"«  -  foF(t- 


s)dW 


(6) 


Thus,  if  we  were  to  use  this  representation,  then  we  would  have  (after 
filtering  to  obtain  Y  =  S  +  N)  an  S+N  process  which  is  completely  determined 
by  the  noise;  obviously  an  unrealistic  and  unsatisfactory  representation. 

However,  we  can  now  appeal  to  the  necessary  conditions.  According  to 
those  conditions,  we  also  have  the  representation  of  Y  given  by 


Yt  =  jV(t,s)dZ*  (7) 

where  F  is  as  above,  and  Z  has  the  representation 

Z*  =  jV(s,Zu,  u  $  s)ds  +  V*t  (8) 

where  now  W*  is  also  a  standard  Wiener  process,  but  is  a  functional  of  the  S+N 
process,  Y.  Moreover,  since  the  statistical  distributions  of  W*  and  W  are  the 

it 

same,  they  determine  the  same  probability  on  Cr0.11:  similarly  Z  and  Z  _ 
determine  the  same  probability  on  L^ro.ll.  We  can  therefore  assume  (7)  and 
(8),  rather  than  (5)  and  (6).  This  is  a  KEY  fact  in  determining  the 
discrete-time  detection  algorithm.  Without  this,  the  representation  derived 
in  the  next  section  would  be  fallacious,  since  it  would  ostensibly  be  based  on 
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a  S+N  process  that  depends  only  on  the  noise.  This  is  just  one  of  the  several 
subtleties  included  in  the  derivation  of  the  detection  algorithm  which  derive 
from  the  complicated  mathematics  of  the  continuous  time  problem,  and  which  are 
in  turn  so  often  dismissed  as  "just  mathematics." 

In  order  to  implement  the  likelihood  ratio,  it  is  clear  that  one  must 
know  the  likelihood  ratio  of  Z  (or  Z  )  with  respect  to  B,  and  also  the 
function  m.  The  likelihood  ratio  of  Z  with  respect  to  B  can  be  determined 
from  known  results,  albeit  in  functional  form.  Determination  of  the  unknown 
parameters  and  the  function  m  is  addressed  in  the  next  section. 

5.  Detection  Algorithms 

Summary.  A  discrete-time  algorithm  is  obtained  by  discretizing  the 
approximation  to  the  continuous- time  likelihood  ratio.  This  algorithm 
depends  on  the  covariance  matrix  of  the  noise  and  the  drift  function  of  a 
diffusion.  This  diffusion,  when  filtered,  gives  an  approximation  to  the 
S+N  process.  We  show  that  if  the  drift  function  is  known,  then  knowledge 
of  the  noise  covariance  matrix  enables  one  to  obtain  the  same  PFA  as  if 
the  (continuous-time)  filter  were  known,  while  the  PD  value  will 
approximate  the  value  that  would  be  obtained  if  the  filter  were  known 
(the  approximation  improving  as  sampling  rate  increases). 

The  preceding  results  are  all  for  the  original  continuous- time  problem. 
Although  they  are  very  general,  they  suffer  from  the  usual  shortcomings  of 
mathematical  research  devoted  to  existence  of  likelihood  ratios-'  they  are 
quite  useless  as  practical  detection  algorithms  unless  one  is  in  the  rare  (one 
may  say  nonexistent)  situation  of  having  a  reliable  and  complete  mathematical 
model  which  includes  all  the  parameters  needed  to  implement  the  continuous- 
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time  likelihood  ratio.  However,  they  form  a  basis  for  deriving  practical 
discrete-time  detection  algorithms  that  are  solidly  based  on  the  continuous¬ 
time  likelihood  ratio,  and.  in  particular,  which  satisfy  the  four  criteria 
mentioned  in  Section  3: 

(1)  Likelihood-ratio-based; 

(2)  Information-preserving; 

(3)  Imp lemen table;  and 

(4)  Adaptive  to  the  S+N  process. 

Two  algorithms  will  now  be  derived.  One  meets  all  four  of  the  above 
criteria.  The  other  meets  all  except  (4);  it  requires  prior  knowledge  of  the 
S+N  process.  However,  it  can  model  a  wider  class  of  S+N  processes  than  the 
adaptive  algorithm,  has  the  capability  of  being  relatively  more  easily 
implemented  in  recursive  form,  and  should  give  better  performance  in  many 
situations  when  the  necessary  prior  information  is  available. 

We  now  proceed  to  the  development  of  these  two  detection  algorithms. 
Version  I,  fully  adaptive  to  the  S+N  process,  is  based  on  the  following 
additional  assumptions: 

(A.l)  The  noise  process  has  multiplicity  M=l,  and  the  process  (B^(t))  is 
the  standard  Wiener  process  (W(t));  thus  N(t)  =  F(t,s)dW(s), 
where  F  is  a  Volterra  kernel  with  Jq  F^(t,s)dsdt  <  »; 

(A. 2)  The  signal -plus-noise  process  can  be  represented  as  Y(t)  = 

Jq  F(t,s)dZ(s),  where  the  process  (Z()  is  a  diffusion  with  respect 

A 

to  a  Wiener  process  W  and  has  time-invariant  drift  function,  so  that 

Z(t)  =  I*  0[Z(s)]ds  +  W(t) .  (9) 

J0 

where  P{u:  ,fQ0^[Z(w.s)]ds  <  «}  =  1. 
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The  second  algorithm.  Version  II,  assumes  (A.l)  above  and  that  the  signal- 
plus-noise  process  can  be  represented  as  Y(t)  =  J"q  F(t,s)dZ(s),  where 

(A. 3)  Z(t)  =  f  o[s.  Z(s)]ds  +  W(t),  (10) 

J0 

*  12  2 
where  again  W  is  a  Wiener  process,  and  P{<j:  SqO  [s,  Z(u,s)]  ds  <  «}  =  1. 

Assumption  (A.l)  can  be  justified  by  noting  that  all  stationary  processes 

and  all  discrete-time  processes  are  of  unit  multiplicity,  that  any  Gaussian 

vector  can  be  represented  as  the  result  of  a  lower-triangular  matrix  operating 

on  white  Gaussian  noise,  and  that  unit  multiplicity  processes  are  dense  (in 

LgCdPdt])  in  the  class  of  m.s.  continuous  processes  [19].  Moreover,  one  can 

show  that  the  assumption  (A.l)  is  satisfied  whenever  the  noise  process  has  a 

representation  N(t)  =  J*  F(t,s)dB(s),  where  the  variance  of  (B(t))  is  an 

absolutely  continuous  function  on  [0.1].  To  see,  this  we  note  that  an 

independent-increment  Gaussian  process  B  with  B(0)  =  0  (with  probability  one) 

2 

has  the  representation  B(t)  =  W[h(t)],  where  EB  (t)  =  h(t)  and  F(0)  =0.  In 
our  setup,  h  is  continuous,  non-decreasing,  and  we  can  assume  that  h(t)  >  h(s) 
if  t  >  s.  Thus, 

[^(t.sJdBfs)  =  fV^.sJdWCMs)]  =  P1  ^  ^F(  t  ,h~*(u)  )dW(u) 

0  0 

(setting  h(s)  =  u) 

=  p"  G( t ' ,u)dW(u) 

U 

where  t’  =  h  1 ( t ) .  G(t',u)  =  F(t,h  *(u)).  Note  that  if  u  >  t'.  then 
u  >  h  *(t).  so  h(u)  >  t;  hence  G(t'.u)  =  0  if  u  >  t’. 

To  clarify  the  slgnif icance  of  the  assumptions  (A. 2)  and  (A. 3),  we 
reference  well-known  material  on  the  representation  of  processes  (Z^)  such 
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that  the  likelihood  ratio  dP^/dP^  exists.  From  Theorem  7.11  of  [19],  any  such 
(Z^)  must  be  a  process  of  "diffusion  type”: 

Z(w.t)  =  f  t[s ,  Z(u.u).  u  i  s)]ds  +  W(w.t)  (11) 

J0 

A 

where  (Wt)  is  a  Viener  process  adapted  to  o(Z),  £ind  nr  is  a  function  on 
[0. l]xC0[0. 1]  such  that 

i)  for  all  s  in  [0,1]  and  all  functions  x  in  C[0,1],  -r(s.x)  does  not 
depend  on  x(t)  for  any  t  >  s; 

ii)  P{u-*  /Q-r2[s.Z(«.  #)]ds  <  •}  =  1. 

From  the  necessary  conditions  described  in  Section  4,  it  can  be  seen  that 
assumption  (A. 3)  reduces  to  the  assumption  that  the  function  nr  is  memoryless. 
Of  course,  there  is  a  very  large  class  of  processes  such  that  this  assumption 
is  satisfied.  Note  that  this  is  emphatically  not  equivalent  to  the  assumption 
that  the  observed  signal-plus-noise  process  is  the  solution  to  a  stochastic 
differential  equation.  To  be  precise,  given  that  assumption  (A.l)  is 
satisfied,  the  assumption  (A. 3)  states  that  the  signal-plus-noise  process  can 
be  represented  as  a  filtered  diffusion.  Note  that  (A. 3)  is  actually  an 
approximation  to  (11);  it  is  necessary  in  order  to  estimate  likelihood  ratio 
parameters  from  data. 

Assumption  (A. 3)  is  of  course  much  weaker  than  (A. 2);  the  latter  assumes 
not  only  that  the  process  Z  defined  above  is  a  diffusion,  but  that  the  drift 
function  is  time-homogeneous:  -r[s,  Z(u.u),  u  £  s]  =  0[Z(w,s)]  a.e.  dP(«)ds. 

The  reduction  of  the  problem  to  the  class  of  processes  satisfying  (A. 2) 
is  motivated  by  the  goal  of  developing  a  likelihood- ratio-based  detection 
algorithm  that  can  operate  without  any  prior  knowledge  of  the  signal 
properties:  completely  adaptive  to  the  S+N  process. 
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An  algorithm  will  now  be  described,  assuming  (A - 1 )  and  (A. 3).  and  uniform 
sampling  of  the  continuous- time  waveform.  For  the  detection  problem  as 
defined  above,  applying  the  results  for  the  likelihood  ratio  described  in 
Section  4,  and  known  results  for  the  Wiener  process  (see,  e.g. ,  [17]  or  [20]), 
the  general  form  (under  a  mild  restriction)  of  the  likelihood  ratio  on  I^O.l] 
Is 

[dPg+N/dPN](x)  =  lim  exp  [An(6n(m[x]))] 
n 

where  0  =  t”  <  t"  <  tg  <  ...  <  t”  =  T  is  a  partition  of  [0,T]  such  that 
sup  |tj+1  -  t ” |  -♦  0,  6n(x)  =  (x(t”),  x(t”) . x( ) •  and 

j 

n-1 

An[6  [m(x)]]  =  Z  cr(i.  m[x](t"))(m[x](t" )  -  m[x](t"))  (12) 

n  i=0 

-  (1/2)^  a2(i.  m[x](tj))(tj+1  -  t?). 
i=0 

m  is  defined  as  in  Section  4,  and  the  limit  exists  in  the  norm  of  Lj[P^]. 

It  should  be  noted  that  this  approximation  does  not  arise  from  sampling 
the  continuous- time  observation.  The  situation  is  much  more  complicated;  the 
approximation  is  obtained  by  sampling  of  the  continuous-time  function  m(x), 
where  x  is  the  observation.  m(x)  is  a  continuous  function  vanishing  at  zero. 
The  difficulty  is  that  m  will  generally  not  be  known. 

The  representation  of  (N(t))  by  N(t)  =  J*  F(t,s)dW(s)  yields  that 

||  || 

=  FF  .  where  F  is  the  integral  operator  with  F(t,s)  as  its  kernel,  and  F 

is  its  adjoint.  This  can  be  used  to  provide  an  approximation  to  the  function 

m(x)  appearing  in  (12)  that  does  not  require  calculation  of  eigenvalues  and 

eigenvectors. 

T  t 

First,  notice  that  <ej,ft>  =  Sq  Sq  F(s.u)du  ej(s)ds  = 

Jq  Jq  F(s.u)ej(s)dsdu  =  [LF*ej](t),  where  [Lf](t)  =  Sq  f(v)dv.  L  is  a  function 
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from  L^O.l]  into  C[0,1]  (the  continuous  functions  on  [0,1]).  The  expression 
for  m  given  in  [1]  can  then  be  written  as 


m[x](t)  =  lim  [LF* 
k-*» 


=  lim  [LrtjJ1?  x](t) 
k-*» 


(13) 


where  P^x  is  the  projection  of  the  function  x  on  the  subspace  in  LgCO.l] 

-1  *-i  -1 

spanned  by  (e^ . e^}.  Since  =  F  F  ,  the  preceding  becomes  m[x](t)  = 

11m  [LF_1P  x](t). 
k-w» 

For  almost  all  sample  functions  x  (either  from  noise  or 
signal-plus-noise),  m[x](*)  is  a  continuous  function  on  [0.1].  Thus  the  map  m 
is  a  linear  operator  from  ^[0.1]  into  C[0,1]  whose  domain  includes  (with 
probability  one)  all  sample  functions  of  the  noise  and  signal-plus-noise 
processes. 

The  difficulty  in  implementation  of  the  approximate  likelihood  ratio  (12) 
will  lie  in  Jetermining  the  function  a  and  linear  operator  m.  a  is  a  parameter 
of  the  signal -plus-noise  process,  and  its  estimation  is  a  problem  of 
considerable  interest  in  stochastic  processes  (as  the  drift  of  a  diffusion) 
and  in  stochastic  filtering.  The  possibly  unbounded  linear  operator  m.  mapping 
l^O.l]  into  C[0,1],  depends  only  on  the  covariance  function  of  the  noise.  If 
a  is  known  or  estimated,  and  if  the  noise  covariance  function  is  known  or 
estimated,  then  the  preceding  expressions  can  be  used  to  obtain  a 
discrete-time  finite-sample  approximation  to  the  likelihood  ratio.  We  now 
develop  such  approximations. 

Let  gJJ  denote  the  covariance  matrix  of  the  noise;  one  can  write 
a”f  F*.  where  the  matrix  F  is  lower  triangular  and  An  is  the  sampling 
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interval  (uniform  sampling).  The  expression  for  m  given  above  is  of  the  form 

m[x](t)  =  lim  [LF  ^.xjft). 
k-*»  K 

At 

where  R^  =  FF  ,  L  is  the  integration  operator,  and  is  the  projection  of  x 

onto  the  subspace  spanned  by  (e^ . e^} .  where  {en>  n^l)  are  o.n. 

eigenvectors  of  R^.  Thus,  a  reasonable  procedure  is  simply  to  replace  this 

expression  by  mn[xn]  =  L  F  *xn.  where  xn  is  the  observed  data  vector,  an 
—  — n- n  — 

element  of  IRn.  and  is  the  summation  operator  in  R0;  (L^x11^  =  xj • 

However,  this  is  so  far  simply  an  ad  hoc  assumption.  Thus,  we  now 

examine  the  relation  of  L  F  *xn  to  m(x),  where  xn  =  (x(An),  x(2An) . 

x(nAn)). 

As  discussed  in  Section  4.  (mo$^)(x)  =  x  a.e.  dP^(x) ,  where  $  maps  W  into 

N.  The  distribution  of  m(N)  is  accordingly  given  by  P^,  so  that  the  vector 

6n[m(N)]  =  (m(N)(An),  m(N)(2An),  ....  m(N)(nAn))  has  probability  distribution 

P^ofi^.  where  6^  C[0,1]  -*  Rn  is  defined  by  ^(x)  =  (x(An),  x(2An) . 

x(nAn)).  Similarly,  defining  mn(xn)  =  L^F^x11,  mn(Nn)  has  probability 

distribution  P-ofi  ,  from  the  definition  of  F  .  Thus,  in  the  preceding 
w  n  ~ n 

expression  (12)  for  An[6n[m(x)]],  and  setting  t^  =  iAn,  one  can  replace 
6n[m(x)]  by  mn[xn];  with  respect  tc  P^,  the  law  of  An[mn(xn)]  will  be  the  same 
as  that  of  An[6n[m(x)]] . 

Next,  suppose  that  the  signal  is  present.  In  examining  the  relation 
between  the  law  of  An[6n[m(x)]] ,  as  given  by  (12),  and  the  law  of  An[mn[xn]] , 
obtained  by  substituting  mn(xn)  for  6n[m(x)],  we  make  the  following  smoothness 
assumptions: 

(a)  F^i.J)  £  F(iAn,  jAn)  for  all  i.J  *  n; 

(b)  The  law  of  the  random  vector  in  Pn  with  (i+1)  component  given  by 
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^  n 

J<7[s,Z(s)]ds  +  W([i+1]A  )  is  approximately  the  law  of  the  random 
0 

vector  in  IRn  with  (i+1)  component  given  by 

An  Z  a[kAn.Z(kAn)]  +  W([i+l]An),  where 
k=l 

Z(t)  =  (i  o(s,Z(s))ds  +  f(t) 

J0 

as  in  assumption  (A. 3). 

Assumption  (a)  is  essentially  equivalent  to  assuming  that 


(lAj)A 

S 


n 

F(iAn.s)F(jAn,s)ds 


0 


.n 


Z  F(iA?  kAn)F(JAn.  kAn) 
k=l 


for  every  i,j  £  n.  It  is  thus  an  assumption  on  the  smoothness  of  {F(t.»).  t 
in  [0.1]}.  Note,  however,  that  the  smoothness  requirement  applies,  for  fixed 
t.  only  to  F(t,«)  restricted  to  [0,t].  Similarly,  (b)  amounts  to  a  smoothness 
assumption  on  a. 

When  signal  is  present,  letting  y11  be  the  vector  such  that  yn(k)  = 


J  F(kA  . s)dZ(s).  one  has 
0 


n,.  n-,  *  *."1  n 

m  [x  ]  =  L  F  y  . 


Assumptions  (a)  and  (b)  then  give  that  the  law  of  mn[xn]  is  approximately  that 
of  6n[m(x)].  with  the  law  of  6n[m(x)]  being  approximately  that  of  the  random 
vector  with  (i+1)  component  given  by 


An  >o[kAn.  Z(kAn)]  +  W((i+l)An) . 
k=l 
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Thus,  with  the  smoothness  assumptions  (a)  and  (b).  An[6n[m(x)]]  and 
An[mn[xn]]  have  approximately  the  same  distribution  w.r.t.  Pg+^-  From  the 
nature  of  assumptions  (a)  and  (b),  it  can  be  seen  that  (if  F  and  a  are 
sufficiently  smooth)  the  approximations  can  be  expected  to  become  better  (for 
a  fixed  observation  time)  as  n  increases  (An  decreases). 

We  thus  have,  under  the  assumption  that  a  is  known,  and  with  the 
smoothness  assumptions  on  F  and  a,  an  approximation  to  the  discretized 
continuous-time  likelihood  ratio.  The  probability  of  false  alarm  fPp^l 
calculated  under  this  approximation  will  be  exactly  that  which  one  would 
obtain  with  a  discretized  version  of  the  exact  continuous-time  likelihood 
ratio  (under  assumptions  A.l  and  A. 3).  Thus,  the  false  rlarm  rate  is  exact 
for  a  given  sampling  rate.  The  probability  of  dc  ection  (Pp)  will  be  an 
approximation  to  that  which  would  be  obtained  using  a  discretized  version  of 
the  exact  continuous-time  likelihood  <atio  when  A.l  ■**..,  a. 3  are  satisfied. 

In  most  applications,  of  course,  a  * i 1 1  not  be  known.  We  now  describe 
two  approaches  to  obtai.-Jng  an  estii.ate  of  a,  corresponding  to  the  two 
assumption  (  .2)  and  (A  ,  both  based  on  replacing  6n[m[x]]  with  mn[xn]  in 
tie  expression  (12)  for  the  discretized  log-likelihood  ratio. 

F„  st,  suppose  '  sumption  (A. 3)  is  satisfied,  and  that  a  training 
ensemble  of  representative  signal-plus-noise  data  is  available,  consisting  of 
K  vectors  (x*.  i  £  K),  each  having  n  components,  with  x*(j)  =  x1(jA).  It  is 
assumed  that  the  vectors  represent  independent  samples.  One  first  applies  the 
matrix  F^  to  each  element  of  this  ensemble.  Under  the  assumptions  (a)  and 
(b)  above,  this  gives  the  ensemble  of  vectors  {5Z*,  i  £  K) ,  where 

(^Hj)  =  z1[( j+l)An3  -  Z*[ jAn] 

=  Ana[j.  ZJ( jAn)]  +  W1[( j+l)An]  -  Wi[jAn3- 
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] 

I 

One  no*  fixes  j,  and  uses  the  K  values  having  the  above  expression  (i  £  K)  to 
estimate  o(j.*).  Various  procedures  can  be  used  to  carry  out  the  estimation; 
note  that  defining  (6W1)(J)  =  W1^ j+l)An]  -  W^JA®].  the  set  {(AW1)^).  i  £  K} 
consists  of  i.l.d.,  random  variables,  with  each  Gaussian,  mean  zero,  variance 
An. 

With  this  estimated  a  inserted  into  the  expression  (12)  for  An,  a  sample 
vector  x11  is  observed.  mn[xn]  is  then  formed,  and  used  in  the  expression  for 
An  to  form  the  test  statistic  An[mn[xn)]. 

If  a  representative  training  ensemble  of  signal-plus-noise  data  is 
available  from  which  one  can  estimate  a.  or  if  a  is  known  from  a  mathematical 
model,  then  the  above  procedure  gives  the  preferred  mechanization  (Version  II 
algorithm).  In  the  case  of  non-s tat ionary  signal  and  noise,  obtaining  an 
ensemble  of  S+N  data,  properly  aligned,  can  be  expected  to  be  difficult. 

However,  if  the  slgnal-plus-nolse  is  a  stationary  process,  then  one  may  opt  to 
use  a  long  segment  of  S+N  data  to  estimate  a  time-invariant  a;  this  segment 
could  then  be  much  longer  theui  the  observation  time  over  which  the  detection 
algorithm  is  to  perform.  It  can  be  shown  [3]  that  use  of  a  time-varying  a 
gives  an  exact  likelihood  ratio  for  the  discrete-time  problem  if  the  signal- 
plus-noise  process  is  Gaussian. 

•t 

Suppose  next  that  nothing  is  known  about  the  statistical  properties  of 
the  slgnal-plus-nolse  process,  and  that  an  ensemble  of  training  data  is  not 
available.  Version  I  of  the  algorithm  (12)  is  then  Implemented  as  follows, 
for  a  fixed  value  of  n:  The  observed  sample  vector  xn  is  first  used  to 
estimate  a  time-homogenous  o;  the  estimated  a  is  inserted  into  the  expression 
(12)  for  An,  and  then  An[mn(xn)]  is  evaluated.  This  version  of  the  algorithm 
corresponds  to  assumption  (A. 2). 
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The  estimation  of  a  is  made  from  the  single  observed  sample  vector  xn 
which  is  to  be  tested  for  the  presence  of  a  signal.  One  applies  to  xn; 
assuming  that  x11  represents  signal-plus-noise,  this  yields  a  vector  6Z,  which 
has  the  representation  (under  assumptions  (a)  and  (b)) 

(6Z)(j+l)  =  Ana[Z(JAn)]  +  (6W)( j+1) , 

(6W)(j+l)  =  W[(j+l)An]  -  W[jAn]. 

The  elements  of  (5W(j).  j  £  K}  are  i.i.d.  random  variables,  with  each 
Gaussian,  mean  zero,  and  variance  An. 

The  preceding  discussion  will  now  be  summarized.  First,  a  is  either 
known  or  else  is  estimated  by  one  of  the  two  procedures  described  above  when 
the  observation  is  an  n-component  vector  xn.  The  test  statistic,  an 
approximation  to  the  continuous- time  log-likelihood  ratio  under  the  assumption 
that  the  noise  has  multiplicity  M  =  1.  then  has  the  expression  (with  the 
definition  of  An  slightly  changed  from  (12)) 

-  (An/2)  "y  a2(J.  [(LnE^1xn)J]  (14) 

If  now  a  new  data  point  xr+j  is  observed,  the  approximation  has  the  recursive 
form 

An+V+1)  =  An(!S)  *  »[n.  (LT'xytE^x)^  (15) 

-  (An/2)  o2[n.  (L  F_1x)n]. 
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The  above  procedure  requires  relatively  few  additional  operations  when  a 
new  data  point  is  observed.  The  implementation  and  calculation  of  A  require 
the  following  operations.  First,  the  function  a  must  be  known  and  programmed 
or  estimated  from  the  observation.  Given  the  value  of  An(xn)  and  the  observa¬ 
tion  xn  =  (x- . x  ).  one  stores  An(xn).  xn.  o[n.  (L  F  *xn)  1.  and 

l  n  n-  n  -  n 

(LJF^*xn)^.  When  the  data  point  *n+j  Is  received,  it  is  only  necessary  to  use 
the  vector  xn+*  to  calculate  (F^^x11*^ )  ^ .  which  is  simply  cross-correlation 

of  the  observation  vector  xn+*  with  row  n+1  of  F^j.  This  number,  say  bn+1>  is 
then  used  to  form  An+^(xn+^), 


An+1(xn+1)  =  An(xn)  +  a[n,  2  b.]b  -  (An/2)  o2[n.  2  bj.  (16) 

1=1  i=l  1 

Throughout,  we  have  made  the  assumption  that  the  noise  covariance  is 

known.  One  then  knows  (F^.  n  >  1},  and  thus  (F^*.  n  £  1).  As  mentioned,  each 

new  observation  of  a  data  point  requires  only  cross-correlation  of  the 

observed  vector,  an  element  of  tT1.  with  row  n+1  of  F  It  is  not 

- n+l 

necessary  to  apply  the  matrix  F^j  to  the  data  vector. 

Implementation  of  the  recursive  form  of  the  algorithm  is  done  most 
conveniently  when  a  is  known,  or  when  a  training  ensemble  of  S+N  data  can  be 
used  to  estimate  a.  If  one  must  estimate  a  from  the  observed  data  (Version  I 
algorithm),  then  the  recursive  formulation  given  above  will  need  modification. 
Various  approaches  can  then  be  used  for  updating  the  estimate  of  a,  depending 
on  the  amount  of  storage  available,  etc. 

A  pleasing  and  important  feature  of  the  algorithms  in  that  Version  II  is 
a  likelihood  ratio  (exact)  when  the  S+N  process  is  Gaussian  (see  [3)). 

The  performance  of  the  algorithm  can  be  expected  to  depend  not  only  on 
the  properties  of  the  data,  but  also  on  the  sampling  rate  and  the  choice  of 


Opt. De tec.  -  7/6/92  -  28 


the  specific  estimation  procedure  for  estimating  a.  Thus,  implementation  for 
a  particular  application  should  be  preceded  by  an  extensive  study  featuring 
both  simulated  and  experimental  data.  Limited  computational  evaluations  of 
these  algorithms  have  given  excellent  results  with  passive  sonar  data  [21], 

6.  Concluding  Remarks 

Many  discrete-time  finite-sample  detection  algorithms  for  problems  such 
as  discussed  here  are  obtained  from  consideration  of  only  the  discrete-time 
(and  finite-dimensional)  problem.  It  is  obviously  preferable,  if  possible,  to 
develop  a  discrete-time  algorithm  based  on  approximations  to  the  likelihood 
ratio  for  the  continuous- time  problem. 

However,  likelihood  ratios  for  continuous-time  problems  involve 
conditions  for  existence  of  the  likelihood  ratio.  Unfortunately,  studies  on 
existence  of  likelihood  ratios  are  frequently  considered  to  be  of  only 
mathematical  interest.  As  can  be  seen  from  the  preceding  development, 
appropriate  studies  on  absolute  continuity  and  likelihood  ratios  for 
continuous-time  problems  can  be  extremely  important  in  developing  practical 
discrete-time  detection  algorithms. 
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§27@  A  method  of  assimilating  3-hourly  sounding  data  is  developed  and 

successfully  tested  in  this  study.  First,  the  successive  corrections 
scheme  of  Bratseth  (1986)  which  converges  to  optimum  interpolation,  is 
applied  for  the  numerical  analysis  of  data  collected  during  the  Genesis 
of  Atlantic  Lows  Experiment  (GALE).  Next,  diabatic  forcing  is 
incorporated  into  a  vertical  mode  initialization  scheme  to  proivde  more 
realistic  initial  conditions  and  to  shorten  the  spinup  time  of  the  Naval 
Research  Laboratory /North  Carolina  State  University  (NRL/NCSU)  mesoscale 
mode.  Latent-heating  profiles  are  computed  from  'spun-up' 
model-generated  and  observed  rainfall.  Finally,  the  multivariate, 
successive  correction  analysis  scheme  correction  analysis  scheme  and  the 
diabatic  initialization  procedure  are  combined  with  the  NRL/NCSU  model 
to  form  an  intermittent  data-assimilation  system.  Assimilations  of  the 
GALE  data  over  a  2  1/2-day  period  were  performed  with  differing  update 
cycles  of  3,  6,  and  12  h. 
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